skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Search for: All records

Creators/Authors contains: "Maxwell, Michael"

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

  1. This paper describes on-going work in dictionary digitization, and in particular the processing of OCRed text into a structured lexicon. In processing the output of an OCR engine into structured data, we are faced with three problems: 1. Typographic errors; 2. Conversion from the dictionary’s visual layout into a lexicographically structured computer-readable format, such as XML; and 3. Converting each dictionary’s idiosyncratic structure into some standard tagging system. This paper deals mostly with the second issue, but touches on the first and third. 
    more » « less